Overview
Brought to you by YData
Dataset statistics
| Number of variables | 40 |
|---|---|
| Number of observations | 346 |
| Missing cells | 1045 |
| Missing cells (%) | 7.6% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 151.5 KiB |
| Average record size in memory | 448.3 B |
Variable types
| Numeric | 15 |
|---|---|
| DateTime | 4 |
| Categorical | 21 |
Signet_Ring has constant value "1.0" | Constant |
AJCC_Substage is highly overall correlated with LN_Positive and 2 other fields | High correlation |
CEA_PreOp is highly overall correlated with Log_CEA_PreOp and 1 other fields | High correlation |
Chart_No is highly overall correlated with Patient_ID | High correlation |
DFS_Months is highly overall correlated with Death and 2 other fields | High correlation |
Death is highly overall correlated with DFS_Months and 3 other fields | High correlation |
Death_Cause is highly overall correlated with Death and 1 other fields | High correlation |
Differentiation is highly overall correlated with Op_Procedure | High correlation |
Dx_Year is highly overall correlated with OS_Months and 1 other fields | High correlation |
Histology is highly overall correlated with Mucinous_Any and 1 other fields | High correlation |
LNR is highly overall correlated with LN_Positive and 1 other fields | High correlation |
LN_Positive is highly overall correlated with AJCC_Substage and 2 other fields | High correlation |
Log_CEA_PreOp is highly overall correlated with CEA_PreOp and 2 other fields | High correlation |
Mucinous_Any is highly overall correlated with Histology and 1 other fields | High correlation |
Mucinous_Gt_50 is highly overall correlated with Histology and 1 other fields | High correlation |
OS_Months is highly overall correlated with DFS_Months and 2 other fields | High correlation |
Op_Procedure is highly overall correlated with Differentiation and 2 other fields | High correlation |
Patient_ID is highly overall correlated with Chart_No and 1 other fields | High correlation |
Recurrence is highly overall correlated with DFS_Months and 4 other fields | High correlation |
Recurrence_Type is highly overall correlated with CEA_PreOp and 2 other fields | High correlation |
Tumor_Deposits is highly overall correlated with pN_Stage | High correlation |
Tumor_Location is highly overall correlated with Op_Procedure and 1 other fields | High correlation |
Tumor_Location_Group is highly overall correlated with Op_Procedure and 1 other fields | High correlation |
pN_Stage is highly overall correlated with AJCC_Substage and 3 other fields | High correlation |
pT_Stage is highly overall correlated with AJCC_Substage | High correlation |
Histology is highly imbalanced (73.3%) | Imbalance |
Differentiation is highly imbalanced (74.6%) | Imbalance |
Tumor_Deposits is highly imbalanced (69.3%) | Imbalance |
Mucinous_Gt_50 is highly imbalanced (69.3%) | Imbalance |
Mucinous_Any is highly imbalanced (53.7%) | Imbalance |
MSI_Status is highly imbalanced (61.4%) | Imbalance |
Recurrence_Type is highly imbalanced (56.1%) | Imbalance |
BMI has 4 (1.2%) missing values | Missing |
ECOG has 18 (5.2%) missing values | Missing |
pN_Stage has 83 (24.0%) missing values | Missing |
LVI has 4 (1.2%) missing values | Missing |
Signet_Ring has 344 (99.4%) missing values | Missing |
CEA_PreOp has 6 (1.7%) missing values | Missing |
Log_CEA_PreOp has 6 (1.7%) missing values | Missing |
PreOp_Albumin has 57 (16.5%) missing values | Missing |
Recurrence_Date has 258 (74.6%) missing values | Missing |
Recurrence_Type has 258 (74.6%) missing values | Missing |
Patient_ID is uniformly distributed | Uniform |
Patient_ID has unique values | Unique |
Chart_No has unique values | Unique |
LN_Positive has 19 (5.5%) zeros | Zeros |
LNR has 19 (5.5%) zeros | Zeros |
Reproduction
| Analysis started | 2025-11-02 16:21:04.129599 |
|---|---|
| Analysis finished | 2025-11-02 16:21:09.616421 |
| Duration | 5.49 seconds |
| Software version | ydata-profiling vv4.17.0 |
| Download configuration | config.json |
Variables
Patient_ID
Real number (ℝ)
High correlation Uniform Unique
| Distinct | 346 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 173.5 |
| Minimum | 1 |
|---|---|
| Maximum | 346 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.8 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 18.25 |
| Q1 | 87.25 |
| median | 173.5 |
| Q3 | 259.75 |
| 95-th percentile | 328.75 |
| Maximum | 346 |
| Range | 345 |
| Interquartile range (IQR) | 172.5 |
Descriptive statistics
| Standard deviation | 100.02583 |
|---|---|
| Coefficient of variation (CV) | 0.57651775 |
| Kurtosis | -1.2 |
| Mean | 173.5 |
| Median Absolute Deviation (MAD) | 86.5 |
| Skewness | 0 |
| Sum | 60031 |
| Variance | 10005.167 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 1 | 1 | 0.3% |
| 228 | 1 | 0.3% |
| 236 | 1 | 0.3% |
| 235 | 1 | 0.3% |
| 234 | 1 | 0.3% |
| 233 | 1 | 0.3% |
| 232 | 1 | 0.3% |
| 231 | 1 | 0.3% |
| 230 | 1 | 0.3% |
| 229 | 1 | 0.3% |
| Other values (336) | 336 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 | |
| 10 | 1 |
| Value | Count | Frequency (%) |
| 346 | 1 | |
| 345 | 1 | |
| 344 | 1 | |
| 343 | 1 | |
| 342 | 1 | |
| 341 | 1 | |
| 340 | 1 | |
| 339 | 1 | |
| 338 | 1 | |
| 337 | 1 |
Chart_No
Real number (ℝ)
High correlation Unique
| Distinct | 346 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12796041 |
| Minimum | 170832 |
|---|---|
| Maximum | 19350595 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.8 KiB |
Quantile statistics
| Minimum | 170832 |
|---|---|
| 5-th percentile | 1476152.8 |
| Q1 | 9185287.5 |
| median | 14975276 |
| Q3 | 17422205 |
| 95-th percentile | 18510017 |
| Maximum | 19350595 |
| Range | 19179763 |
| Interquartile range (IQR) | 8236917.2 |
Descriptive statistics
| Standard deviation | 5711977.8 |
|---|---|
| Coefficient of variation (CV) | 0.44638634 |
| Kurtosis | -0.6133524 |
| Mean | 12796041 |
| Median Absolute Deviation (MAD) | 2951235.5 |
| Skewness | -0.8577656 |
| Sum | 4.4274301 × 109 |
| Variance | 3.2626691 × 1013 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 170832 | 1 | 0.3% |
| 16802043 | 1 | 0.3% |
| 16981616 | 1 | 0.3% |
| 16962917 | 1 | 0.3% |
| 16934386 | 1 | 0.3% |
| 16926231 | 1 | 0.3% |
| 16843246 | 1 | 0.3% |
| 16829597 | 1 | 0.3% |
| 16825506 | 1 | 0.3% |
| 16813675 | 1 | 0.3% |
| Other values (336) | 336 |
| Value | Count | Frequency (%) |
| 170832 | 1 | |
| 190783 | 1 | |
| 335615 | 1 | |
| 458173 | 1 | |
| 536710 | 1 | |
| 545620 | 1 | |
| 657589 | 1 | |
| 706865 | 1 | |
| 790078 | 1 | |
| 826302 | 1 |
| Value | Count | Frequency (%) |
| 19350595 | 1 | |
| 19332242 | 1 | |
| 19277828 | 1 | |
| 19244963 | 1 | |
| 19234425 | 1 | |
| 19219706 | 1 | |
| 19161821 | 1 | |
| 19127334 | 1 | |
| 19114886 | 1 | |
| 19070510 | 1 |
Dx_Date
Date
| Distinct | 304 |
|---|---|
| Distinct (%) | 87.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.8 KiB |
| Minimum | 2017-01-14 00:00:00 |
|---|---|
| Maximum | 2022-01-07 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
Dx_Year
Categorical
High correlation
| Distinct | 5 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 18.0 KiB |
| 2021 | |
|---|---|
| 2020 | |
| 2019 | |
| 2018 | |
| 2017 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2017 |
|---|---|
| 2nd row | 2017 |
| 3rd row | 2021 |
| 4th row | 2021 |
| 5th row | 2020 |
Common Values
| Value | Count | Frequency (%) |
| 2021 | 80 | |
| 2020 | 78 | |
| 2019 | 76 | |
| 2018 | 63 | |
| 2017 | 49 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2021 | 80 | |
| 2020 | 78 | |
| 2019 | 76 | |
| 2018 | 63 | |
| 2017 | 49 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 504 | |
| 0 | 424 | |
| 1 | 268 | |
| 9 | 76 | 5.5% |
| 8 | 63 | 4.6% |
| 7 | 49 | 3.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1384 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 504 | |
| 0 | 424 | |
| 1 | 268 | |
| 9 | 76 | 5.5% |
| 8 | 63 | 4.6% |
| 7 | 49 | 3.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1384 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 504 | |
| 0 | 424 | |
| 1 | 268 | |
| 9 | 76 | 5.5% |
| 8 | 63 | 4.6% |
| 7 | 49 | 3.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1384 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 504 | |
| 0 | 424 | |
| 1 | 268 | |
| 9 | 76 | 5.5% |
| 8 | 63 | 4.6% |
| 7 | 49 | 3.5% |
Age
Real number (ℝ)
| Distinct | 61 |
|---|---|
| Distinct (%) | 17.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 65.419075 |
| Minimum | 23 |
|---|---|
| Maximum | 98 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.8 KiB |
Quantile statistics
| Minimum | 23 |
|---|---|
| 5-th percentile | 44 |
| Q1 | 56 |
| median | 66 |
| Q3 | 76 |
| 95-th percentile | 85 |
| Maximum | 98 |
| Range | 75 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 13.172021 |
|---|---|
| Coefficient of variation (CV) | 0.20134832 |
| Kurtosis | -0.42762385 |
| Mean | 65.419075 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | -0.15286429 |
| Sum | 22635 |
| Variance | 173.50213 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 62 | 14 | 4.0% |
| 70 | 12 | 3.5% |
| 67 | 11 | 3.2% |
| 81 | 11 | 3.2% |
| 78 | 11 | 3.2% |
| 64 | 11 | 3.2% |
| 50 | 11 | 3.2% |
| 68 | 10 | 2.9% |
| 66 | 10 | 2.9% |
| 60 | 10 | 2.9% |
| Other values (51) | 235 |
| Value | Count | Frequency (%) |
| 23 | 1 | 0.3% |
| 31 | 1 | 0.3% |
| 32 | 1 | 0.3% |
| 36 | 1 | 0.3% |
| 37 | 1 | 0.3% |
| 38 | 2 | 0.6% |
| 39 | 1 | 0.3% |
| 41 | 1 | 0.3% |
| 42 | 6 | |
| 43 | 2 | 0.6% |
| Value | Count | Frequency (%) |
| 98 | 1 | 0.3% |
| 94 | 1 | 0.3% |
| 92 | 1 | 0.3% |
| 91 | 1 | 0.3% |
| 90 | 2 | 0.6% |
| 89 | 2 | 0.6% |
| 88 | 5 | |
| 87 | 2 | 0.6% |
| 86 | 2 | 0.6% |
| 85 | 6 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 2 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 199 | |
| 2 | 147 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 199 | |
| 2 | 147 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 199 | |
| 2 | 147 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 346 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 199 | |
| 2 | 147 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 346 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 199 | |
| 2 | 147 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 346 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 199 | |
| 2 | 147 |
BMI
Real number (ℝ)
Missing
| Distinct | 304 |
|---|---|
| Distinct (%) | 88.9% |
| Missing | 4 |
| Missing (%) | 1.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 23.861725 |
| Minimum | 0 |
|---|---|
| Maximum | 60.61 |
| Zeros | 1 |
| Zeros (%) | 0.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 17.142 |
| Q1 | 20.7725 |
| median | 23.375 |
| Q3 | 26.4925 |
| 95-th percentile | 31.7295 |
| Maximum | 60.61 |
| Range | 60.61 |
| Interquartile range (IQR) | 5.72 |
Descriptive statistics
| Standard deviation | 5.0006798 |
|---|---|
| Coefficient of variation (CV) | 0.20956908 |
| Kurtosis | 9.3192761 |
| Mean | 23.861725 |
| Median Absolute Deviation (MAD) | 2.855 |
| Skewness | 1.2011512 |
| Sum | 8160.71 |
| Variance | 25.006798 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 23.39 | 3 | 0.9% |
| 20.17 | 3 | 0.9% |
| 23.03 | 3 | 0.9% |
| 20.65 | 2 | 0.6% |
| 23.16 | 2 | 0.6% |
| 21.91 | 2 | 0.6% |
| 22.07 | 2 | 0.6% |
| 28.41 | 2 | 0.6% |
| 21.54 | 2 | 0.6% |
| 26.94 | 2 | 0.6% |
| Other values (294) | 319 | |
| (Missing) | 4 | 1.2% |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 13.79 | 1 | |
| 14.15 | 1 | |
| 14.18 | 1 | |
| 14.89 | 1 | |
| 15.02 | 1 | |
| 15.06 | 1 | |
| 15.22 | 1 | |
| 15.42 | 1 | |
| 15.48 | 1 |
| Value | Count | Frequency (%) |
| 60.61 | 1 | |
| 39 | 1 | |
| 38.32 | 1 | |
| 36.27 | 1 | |
| 36.25 | 1 | |
| 35.38 | 1 | |
| 35.36 | 1 | |
| 34.72 | 1 | |
| 34.28 | 1 | |
| 34.21 | 1 |
ECOG
Categorical
Missing
| Distinct | 4 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 18 |
| Missing (%) | 5.2% |
| Memory size | 17.8 KiB |
| 1.0 | |
|---|---|
| 0.0 | |
| 2.0 | |
| 3.0 | 8 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 2.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 214 | |
| 0.0 | 84 | 24.3% |
| 2.0 | 22 | 6.4% |
| 3.0 | 8 | 2.3% |
| (Missing) | 18 | 5.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 214 | |
| 0.0 | 84 | 25.6% |
| 2.0 | 22 | 6.7% |
| 3.0 | 8 | 2.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 412 | |
| . | 328 | |
| 1 | 214 | |
| 2 | 22 | 2.2% |
| 3 | 8 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 984 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 412 | |
| . | 328 | |
| 1 | 214 | |
| 2 | 22 | 2.2% |
| 3 | 8 | 0.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 984 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 412 | |
| . | 328 | |
| 1 | 214 | |
| 2 | 22 | 2.2% |
| 3 | 8 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 984 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 412 | |
| . | 328 | |
| 1 | 214 | |
| 2 | 22 | 2.2% |
| 3 | 8 | 0.8% |
Tumor_Location
Real number (ℝ)
High correlation
| Distinct | 8 |
|---|---|
| Distinct (%) | 2.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.5231214 |
| Minimum | 1 |
|---|---|
| Maximum | 8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.8 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 7 |
| Q3 | 7 |
| 95-th percentile | 8 |
| Maximum | 8 |
| Range | 7 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 2.4603022 |
|---|---|
| Coefficient of variation (CV) | 0.44545503 |
| Kurtosis | -1.1752129 |
| Mean | 5.5231214 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.70981314 |
| Sum | 1911 |
| Variance | 6.053087 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7 | 139 | |
| 8 | 68 | |
| 2 | 66 | |
| 1 | 22 | 6.4% |
| 4 | 21 | 6.1% |
| 6 | 20 | 5.8% |
| 3 | 7 | 2.0% |
| 5 | 3 | 0.9% |
| Value | Count | Frequency (%) |
| 1 | 22 | 6.4% |
| 2 | 66 | |
| 3 | 7 | 2.0% |
| 4 | 21 | 6.1% |
| 5 | 3 | 0.9% |
| 6 | 20 | 5.8% |
| 7 | 139 | |
| 8 | 68 |
| Value | Count | Frequency (%) |
| 8 | 68 | |
| 7 | 139 | |
| 6 | 20 | 5.8% |
| 5 | 3 | 0.9% |
| 4 | 21 | 6.1% |
| 3 | 7 | 2.0% |
| 2 | 66 | |
| 1 | 22 | 6.4% |
Tumor_Location_Group
Categorical
High correlation
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 17.0 KiB |
| 2 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 1 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 227 | |
| 1 | 119 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 227 | |
| 1 | 119 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 227 | |
| 1 | 119 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 346 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 227 | |
| 1 | 119 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 346 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 227 | |
| 1 | 119 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 346 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 227 | |
| 1 | 119 |
pT_Stage
Categorical
High correlation
| Distinct | 5 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 902.0 B |
| 3 | |
|---|---|
| 4A | |
| 2 | |
| 1 | 15 |
| 4B | 14 |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.1676301 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 4A |
|---|---|
| 2nd row | 3 |
| 3rd row | 3 |
| 4th row | 4A |
| 5th row | 3 |
Common Values
| Value | Count | Frequency (%) |
| 3 | 243 | |
| 4A | 44 | 12.7% |
| 2 | 30 | 8.7% |
| 1 | 15 | 4.3% |
| 4B | 14 | 4.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 3 | 243 | |
| 4a | 44 | 12.7% |
| 2 | 30 | 8.7% |
| 1 | 15 | 4.3% |
| 4b | 14 | 4.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 243 | |
| 4 | 58 | 14.4% |
| A | 44 | 10.9% |
| 2 | 30 | 7.4% |
| 1 | 15 | 3.7% |
| B | 14 | 3.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 404 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 3 | 243 | |
| 4 | 58 | 14.4% |
| A | 44 | 10.9% |
| 2 | 30 | 7.4% |
| 1 | 15 | 3.7% |
| B | 14 | 3.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 404 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 3 | 243 | |
| 4 | 58 | 14.4% |
| A | 44 | 10.9% |
| 2 | 30 | 7.4% |
| 1 | 15 | 3.7% |
| B | 14 | 3.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 404 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 3 | 243 | |
| 4 | 58 | 14.4% |
| A | 44 | 10.9% |
| 2 | 30 | 7.4% |
| 1 | 15 | 3.7% |
| B | 14 | 3.5% |
pN_Stage
Categorical
High correlation Missing
| Distinct | 3 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 83 |
| Missing (%) | 24.0% |
| Memory size | 856.0 B |
| 1B | |
|---|---|
| 1A | |
| 2B |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2B |
|---|---|
| 2nd row | 1B |
| 3rd row | 1B |
| 4th row | 1B |
| 5th row | 1B |
Common Values
| Value | Count | Frequency (%) |
| 1B | 109 | |
| 1A | 97 | |
| 2B | 57 | |
| (Missing) | 83 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1b | 109 | |
| 1a | 97 | |
| 2b | 57 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 206 | |
| B | 166 | |
| A | 97 | |
| 2 | 57 | 10.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 526 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 206 | |
| B | 166 | |
| A | 97 | |
| 2 | 57 | 10.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 526 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 206 | |
| B | 166 | |
| A | 97 | |
| 2 | 57 | 10.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 526 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 206 | |
| B | 166 | |
| A | 97 | |
| 2 | 57 | 10.8% |
AJCC_Substage
Categorical
High correlation
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 739.0 B |
| 3B | |
|---|---|
| 3C | |
| 3A |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3C |
|---|---|
| 2nd row | 3B |
| 3rd row | 3B |
| 4th row | 3B |
| 5th row | 3B |
Common Values
| Value | Count | Frequency (%) |
| 3B | 237 | |
| 3C | 69 | 19.9% |
| 3A | 40 | 11.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 3b | 237 | |
| 3c | 69 | 19.9% |
| 3a | 40 | 11.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 346 | |
| B | 237 | |
| C | 69 | 10.0% |
| A | 40 | 5.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 692 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 3 | 346 | |
| B | 237 | |
| C | 69 | 10.0% |
| A | 40 | 5.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 692 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 3 | 346 | |
| B | 237 | |
| C | 69 | 10.0% |
| A | 40 | 5.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 692 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 3 | 346 | |
| B | 237 | |
| C | 69 | 10.0% |
| A | 40 | 5.8% |
LN_Total
Real number (ℝ)
| Distinct | 43 |
|---|---|
| Distinct (%) | 12.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 23.291908 |
| Minimum | 9 |
|---|---|
| Maximum | 65 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.8 KiB |
Quantile statistics
| Minimum | 9 |
|---|---|
| 5-th percentile | 12 |
| Q1 | 17 |
| median | 21 |
| Q3 | 28 |
| 95-th percentile | 40 |
| Maximum | 65 |
| Range | 56 |
| Interquartile range (IQR) | 11 |
Descriptive statistics
| Standard deviation | 9.1666562 |
|---|---|
| Coefficient of variation (CV) | 0.39355541 |
| Kurtosis | 2.7172921 |
| Mean | 23.291908 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 1.402984 |
| Sum | 8059 |
| Variance | 84.027586 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 19 | 22 | 6.4% |
| 16 | 22 | 6.4% |
| 22 | 21 | 6.1% |
| 12 | 21 | 6.1% |
| 17 | 21 | 6.1% |
| 26 | 17 | 4.9% |
| 20 | 17 | 4.9% |
| 21 | 17 | 4.9% |
| 23 | 16 | 4.6% |
| 18 | 15 | 4.3% |
| Other values (33) | 157 |
| Value | Count | Frequency (%) |
| 9 | 1 | 0.3% |
| 12 | 21 | |
| 13 | 12 | |
| 14 | 13 | |
| 15 | 14 | |
| 16 | 22 | |
| 17 | 21 | |
| 18 | 15 | |
| 19 | 22 | |
| 20 | 17 |
| Value | Count | Frequency (%) |
| 65 | 1 | 0.3% |
| 64 | 1 | 0.3% |
| 54 | 3 | |
| 51 | 1 | 0.3% |
| 50 | 1 | 0.3% |
| 49 | 1 | 0.3% |
| 48 | 2 | |
| 47 | 1 | 0.3% |
| 45 | 1 | 0.3% |
| 44 | 1 | 0.3% |
LN_Positive
Real number (ℝ)
High correlation Zeros
| Distinct | 18 |
|---|---|
| Distinct (%) | 5.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.6589595 |
| Minimum | 0 |
|---|---|
| Maximum | 32 |
| Zeros | 19 |
| Zeros (%) | 5.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2.5 |
| Q3 | 5 |
| 95-th percentile | 11 |
| Maximum | 32 |
| Range | 32 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 3.7485291 |
|---|---|
| Coefficient of variation (CV) | 1.0244795 |
| Kurtosis | 11.369561 |
| Mean | 3.6589595 |
| Median Absolute Deviation (MAD) | 1.5 |
| Skewness | 2.6127072 |
| Sum | 1266 |
| Variance | 14.05147 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 96 | |
| 2 | 58 | |
| 3 | 52 | |
| 4 | 29 | 8.4% |
| 0 | 19 | 5.5% |
| 5 | 19 | 5.5% |
| 6 | 16 | 4.6% |
| 7 | 15 | 4.3% |
| 8 | 11 | 3.2% |
| 10 | 6 | 1.7% |
| Other values (8) | 25 | 7.2% |
| Value | Count | Frequency (%) |
| 0 | 19 | 5.5% |
| 1 | 96 | |
| 2 | 58 | |
| 3 | 52 | |
| 4 | 29 | 8.4% |
| 5 | 19 | 5.5% |
| 6 | 16 | 4.6% |
| 7 | 15 | 4.3% |
| 8 | 11 | 3.2% |
| 9 | 4 | 1.2% |
| Value | Count | Frequency (%) |
| 32 | 1 | 0.3% |
| 20 | 2 | 0.6% |
| 16 | 2 | 0.6% |
| 14 | 4 | 1.2% |
| 13 | 2 | 0.6% |
| 12 | 6 | |
| 11 | 4 | 1.2% |
| 10 | 6 | |
| 9 | 4 | 1.2% |
| 8 | 11 |
LNR
Real number (ℝ)
High correlation Zeros
| Distinct | 134 |
|---|---|
| Distinct (%) | 38.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.16873022 |
| Minimum | 0 |
|---|---|
| Maximum | 0.92307692 |
| Zeros | 19 |
| Zeros (%) | 5.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.055555556 |
| median | 0.11324786 |
| Q3 | 0.21428571 |
| 95-th percentile | 0.51973684 |
| Maximum | 0.92307692 |
| Range | 0.92307692 |
| Interquartile range (IQR) | 0.15873016 |
Descriptive statistics
| Standard deviation | 0.16402123 |
|---|---|
| Coefficient of variation (CV) | 0.97209163 |
| Kurtosis | 3.1422627 |
| Mean | 0.16873022 |
| Median Absolute Deviation (MAD) | 0.067793318 |
| Skewness | 1.7454062 |
| Sum | 58.380656 |
| Variance | 0.026902965 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 19 | 5.5% |
| 0.07142857143 | 12 | 3.5% |
| 0.25 | 10 | 2.9% |
| 0.1666666667 | 9 | 2.6% |
| 0.05882352941 | 9 | 2.6% |
| 0.05263157895 | 8 | 2.3% |
| 0.08333333333 | 7 | 2.0% |
| 0.1428571429 | 7 | 2.0% |
| 0.1363636364 | 7 | 2.0% |
| 0.1 | 7 | 2.0% |
| Other values (124) | 251 |
| Value | Count | Frequency (%) |
| 0 | 19 | |
| 0.01960784314 | 1 | 0.3% |
| 0.02040816327 | 1 | 0.3% |
| 0.02127659574 | 1 | 0.3% |
| 0.02272727273 | 1 | 0.3% |
| 0.025 | 1 | 0.3% |
| 0.02857142857 | 2 | 0.6% |
| 0.02941176471 | 2 | 0.6% |
| 0.0303030303 | 2 | 0.6% |
| 0.03125 | 4 | 1.2% |
| Value | Count | Frequency (%) |
| 0.9230769231 | 1 | |
| 0.8 | 1 | |
| 0.7777777778 | 1 | |
| 0.7647058824 | 1 | |
| 0.7272727273 | 1 | |
| 0.7058823529 | 1 | |
| 0.6666666667 | 1 | |
| 0.6470588235 | 1 | |
| 0.6315789474 | 1 | |
| 0.625 | 1 |
Histology
Categorical
High correlation Imbalance
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 17.0 KiB |
| 1 | |
|---|---|
| 2 | 23 |
| 3 | 3 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 320 | |
| 2 | 23 | 6.6% |
| 3 | 3 | 0.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 320 | |
| 2 | 23 | 6.6% |
| 3 | 3 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 320 | |
| 2 | 23 | 6.6% |
| 3 | 3 | 0.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 346 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 320 | |
| 2 | 23 | 6.6% |
| 3 | 3 | 0.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 346 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 320 | |
| 2 | 23 | 6.6% |
| 3 | 3 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 346 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 320 | |
| 2 | 23 | 6.6% |
| 3 | 3 | 0.9% |
Differentiation
Categorical
High correlation Imbalance
| Distinct | 5 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 17.0 KiB |
| 2 | |
|---|---|
| 3 | 24 |
| 4 | 5 |
| 1 | 4 |
| 9 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 2 |
| 3rd row | 1 |
| 4th row | 3 |
| 5th row | 4 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 312 | |
| 3 | 24 | 6.9% |
| 4 | 5 | 1.4% |
| 1 | 4 | 1.2% |
| 9 | 1 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 312 | |
| 3 | 24 | 6.9% |
| 4 | 5 | 1.4% |
| 1 | 4 | 1.2% |
| 9 | 1 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 312 | |
| 3 | 24 | 6.9% |
| 4 | 5 | 1.4% |
| 1 | 4 | 1.2% |
| 9 | 1 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 346 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 312 | |
| 3 | 24 | 6.9% |
| 4 | 5 | 1.4% |
| 1 | 4 | 1.2% |
| 9 | 1 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 346 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 312 | |
| 3 | 24 | 6.9% |
| 4 | 5 | 1.4% |
| 1 | 4 | 1.2% |
| 9 | 1 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 346 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 312 | |
| 3 | 24 | 6.9% |
| 4 | 5 | 1.4% |
| 1 | 4 | 1.2% |
| 9 | 1 | 0.3% |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 1.0 |
| 4th row | 0.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 174 | |
| 0.0 | 168 | |
| (Missing) | 4 | 1.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 174 | |
| 0.0 | 168 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 510 | |
| . | 342 | |
| 1 | 174 | 17.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1026 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 510 | |
| . | 342 | |
| 1 | 174 | 17.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1026 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 510 | |
| . | 342 | |
| 1 | 174 | 17.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1026 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 510 | |
| . | 342 | |
| 1 | 174 | 17.0% |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 1.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 288 | |
| 1.0 | 55 | 15.9% |
| (Missing) | 3 | 0.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 288 | |
| 1.0 | 55 | 16.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 631 | |
| . | 343 | |
| 1 | 55 | 5.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1029 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 631 | |
| . | 343 | |
| 1 | 55 | 5.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1029 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 631 | |
| . | 343 | |
| 1 | 55 | 5.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1029 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 631 | |
| . | 343 | |
| 1 | 55 | 5.3% |
Tumor_Deposits
Categorical
High correlation Imbalance
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 17.0 KiB |
| 0 | |
|---|---|
| 1 | 19 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 327 | |
| 1 | 19 | 5.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 327 | |
| 1 | 19 | 5.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 327 | |
| 1 | 19 | 5.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 346 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 327 | |
| 1 | 19 | 5.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 346 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 327 | |
| 1 | 19 | 5.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 346 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 327 | |
| 1 | 19 | 5.5% |
Mucinous_Gt_50
Categorical
High correlation Imbalance
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 17.0 KiB |
| 0 | |
|---|---|
| 1 | 19 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 327 | |
| 1 | 19 | 5.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 327 | |
| 1 | 19 | 5.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 327 | |
| 1 | 19 | 5.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 346 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 327 | |
| 1 | 19 | 5.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 346 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 327 | |
| 1 | 19 | 5.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 346 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 327 | |
| 1 | 19 | 5.5% |
Mucinous_Any
Categorical
High correlation Imbalance
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 17.0 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 312 | |
| 1 | 34 | 9.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 312 | |
| 1 | 34 | 9.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 312 | |
| 1 | 34 | 9.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 346 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 312 | |
| 1 | 34 | 9.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 346 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 312 | |
| 1 | 34 | 9.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 346 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 312 | |
| 1 | 34 | 9.8% |
Signet_Ring
Categorical
Constant Missing
| Distinct | 1 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 344 |
| Missing (%) | 99.4% |
| Memory size | 19.0 KiB |
| 1.0 |
|---|
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 2 | 0.6% |
| (Missing) | 344 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 2 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2 | |
| . | 2 | |
| 0 | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 6 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 2 | |
| . | 2 | |
| 0 | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 6 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 2 | |
| . | 2 | |
| 0 | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 6 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 2 | |
| . | 2 | |
| 0 | 2 |
MSI_Status
Categorical
Imbalance
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 2 |
| Missing (%) | 0.6% |
| Memory size | 17.8 KiB |
| MSS | |
|---|---|
| MSI-H | 26 |
Length
| Max length | 5 |
|---|---|
| Median length | 3 |
| Mean length | 3.1511628 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | MSS |
|---|---|
| 2nd row | MSS |
| 3rd row | MSS |
| 4th row | MSI-H |
| 5th row | MSS |
Common Values
| Value | Count | Frequency (%) |
| MSS | 318 | |
| MSI-H | 26 | 7.5% |
| (Missing) | 2 | 0.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| mss | 318 | |
| msi-h | 26 | 7.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 662 | |
| M | 344 | |
| I | 26 | 2.4% |
| - | 26 | 2.4% |
| H | 26 | 2.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1084 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| S | 662 | |
| M | 344 | |
| I | 26 | 2.4% |
| - | 26 | 2.4% |
| H | 26 | 2.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1084 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| S | 662 | |
| M | 344 | |
| I | 26 | 2.4% |
| - | 26 | 2.4% |
| H | 26 | 2.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1084 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| S | 662 | |
| M | 344 | |
| I | 26 | 2.4% |
| - | 26 | 2.4% |
| H | 26 | 2.4% |
Tumor_Size_cm
Real number (ℝ)
| Distinct | 84 |
|---|---|
| Distinct (%) | 24.4% |
| Missing | 2 |
| Missing (%) | 0.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.6331395 |
| Minimum | 0.1 |
|---|---|
| Maximum | 15 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.8 KiB |
Quantile statistics
| Minimum | 0.1 |
|---|---|
| 5-th percentile | 1.815 |
| Q1 | 3.2 |
| median | 4.3 |
| Q3 | 5.525 |
| 95-th percentile | 8.785 |
| Maximum | 15 |
| Range | 14.9 |
| Interquartile range (IQR) | 2.325 |
Descriptive statistics
| Standard deviation | 2.2059568 |
|---|---|
| Coefficient of variation (CV) | 0.4761257 |
| Kurtosis | 2.0694149 |
| Mean | 4.6331395 |
| Median Absolute Deviation (MAD) | 1.2 |
| Skewness | 1.113498 |
| Sum | 1593.8 |
| Variance | 4.8662455 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 18 | 5.2% |
| 4.5 | 17 | 4.9% |
| 3.5 | 16 | 4.6% |
| 3 | 13 | 3.8% |
| 4.7 | 10 | 2.9% |
| 5.5 | 10 | 2.9% |
| 6.5 | 10 | 2.9% |
| 4.3 | 10 | 2.9% |
| 3.4 | 10 | 2.9% |
| 3.2 | 9 | 2.6% |
| Other values (74) | 221 |
| Value | Count | Frequency (%) |
| 0.1 | 1 | 0.3% |
| 0.2 | 1 | 0.3% |
| 0.3 | 1 | 0.3% |
| 0.5 | 1 | 0.3% |
| 0.9 | 2 | |
| 1 | 3 | |
| 1.1 | 1 | 0.3% |
| 1.5 | 2 | |
| 1.6 | 2 | |
| 1.8 | 4 |
| Value | Count | Frequency (%) |
| 15 | 1 | 0.3% |
| 12.5 | 1 | 0.3% |
| 12 | 1 | 0.3% |
| 11.8 | 1 | 0.3% |
| 11.5 | 1 | 0.3% |
| 10.7 | 1 | 0.3% |
| 10.5 | 1 | 0.3% |
| 10 | 4 | |
| 9.8 | 1 | 0.3% |
| 9.5 | 1 | 0.3% |
CEA_PreOp
Real number (ℝ)
High correlation Missing
| Distinct | 132 |
|---|---|
| Distinct (%) | 38.8% |
| Missing | 6 |
| Missing (%) | 1.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 23.872206 |
| Minimum | 0.5 |
|---|---|
| Maximum | 3443 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.8 KiB |
Quantile statistics
| Minimum | 0.5 |
|---|---|
| 5-th percentile | 0.5 |
| Q1 | 1.7 |
| median | 2.95 |
| Q3 | 7.925 |
| 95-th percentile | 44.205 |
| Maximum | 3443 |
| Range | 3442.5 |
| Interquartile range (IQR) | 6.225 |
Descriptive statistics
| Standard deviation | 195.14382 |
|---|---|
| Coefficient of variation (CV) | 8.1745197 |
| Kurtosis | 281.32307 |
| Mean | 23.872206 |
| Median Absolute Deviation (MAD) | 1.75 |
| Skewness | 16.281858 |
| Sum | 8116.55 |
| Variance | 38081.109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.5 | 28 | 8.1% |
| 2.5 | 11 | 3.2% |
| 1.1 | 11 | 3.2% |
| 1.3 | 10 | 2.9% |
| 2.1 | 10 | 2.9% |
| 1.5 | 10 | 2.9% |
| 2.9 | 9 | 2.6% |
| 1.9 | 9 | 2.6% |
| 3.4 | 8 | 2.3% |
| 1.7 | 8 | 2.3% |
| Other values (122) | 226 |
| Value | Count | Frequency (%) |
| 0.5 | 28 | |
| 1 | 7 | 2.0% |
| 1.1 | 11 | 3.2% |
| 1.2 | 3 | 0.9% |
| 1.3 | 10 | 2.9% |
| 1.4 | 8 | 2.3% |
| 1.5 | 10 | 2.9% |
| 1.6 | 2 | 0.6% |
| 1.7 | 8 | 2.3% |
| 1.71 | 1 | 0.3% |
| Value | Count | Frequency (%) |
| 3443 | 1 | |
| 914.2 | 1 | |
| 471 | 1 | |
| 181.7 | 1 | |
| 158.5 | 1 | |
| 142.1 | 1 | |
| 134.6 | 2 | |
| 128.5 | 1 | |
| 79.61 | 1 | |
| 71 | 1 |
Log_CEA_PreOp
Real number (ℝ)
High correlation Missing
| Distinct | 132 |
|---|---|
| Distinct (%) | 38.8% |
| Missing | 6 |
| Missing (%) | 1.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.7241891 |
| Minimum | 0.40546511 |
|---|---|
| Maximum | 8.1443889 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.8 KiB |
Quantile statistics
| Minimum | 0.40546511 |
|---|---|
| 5-th percentile | 0.40546511 |
| Q1 | 0.99325177 |
| median | 1.3736355 |
| Q3 | 2.1888446 |
| 95-th percentile | 3.8112076 |
| Maximum | 8.1443889 |
| Range | 7.7389238 |
| Interquartile range (IQR) | 1.1955928 |
Descriptive statistics
| Standard deviation | 1.1111104 |
|---|---|
| Coefficient of variation (CV) | 0.64442489 |
| Kurtosis | 4.9686681 |
| Mean | 1.7241891 |
| Median Absolute Deviation (MAD) | 0.51343419 |
| Skewness | 1.7840414 |
| Sum | 586.22428 |
| Variance | 1.2345662 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.4054651081 | 28 | 8.1% |
| 1.252762968 | 11 | 3.2% |
| 0.7419373447 | 11 | 3.2% |
| 0.8329091229 | 10 | 2.9% |
| 1.131402111 | 10 | 2.9% |
| 0.9162907319 | 10 | 2.9% |
| 1.360976553 | 9 | 2.6% |
| 1.064710737 | 9 | 2.6% |
| 1.481604541 | 8 | 2.3% |
| 0.993251773 | 8 | 2.3% |
| Other values (122) | 226 |
| Value | Count | Frequency (%) |
| 0.4054651081 | 28 | |
| 0.6931471806 | 7 | 2.0% |
| 0.7419373447 | 11 | 3.2% |
| 0.7884573604 | 3 | 0.9% |
| 0.8329091229 | 10 | 2.9% |
| 0.8754687374 | 8 | 2.3% |
| 0.9162907319 | 10 | 2.9% |
| 0.955511445 | 2 | 0.6% |
| 0.993251773 | 8 | 2.3% |
| 0.9969486349 | 1 | 0.3% |
| Value | Count | Frequency (%) |
| 8.144388866 | 1 | |
| 6.819142621 | 1 | |
| 6.156978986 | 1 | |
| 5.207845463 | 1 | |
| 5.072043922 | 1 | |
| 4.963543687 | 1 | |
| 4.909709376 | 2 | |
| 4.863680881 | 1 | |
| 4.389622711 | 1 | |
| 4.276666119 | 1 |
Radical_Op_Date
Date
| Distinct | 307 |
|---|---|
| Distinct (%) | 88.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.8 KiB |
| Minimum | 2017-01-23 00:00:00 |
|---|---|
| Maximum | 2022-02-04 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
Op_Procedure
Categorical
High correlation
| Distinct | 13 |
|---|---|
| Distinct (%) | 3.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 27.0 KiB |
| Laparoscopic anterior resection | |
|---|---|
| Laparoscopic right hemicolectomy | |
| Laparoscopic low anterior resection | |
| Laparoscopic left hemicolectomy | |
| Anterior resection | |
| Other values (8) |
Length
| Max length | 62 |
|---|---|
| Median length | 41 |
| Mean length | 30.557803 |
| Min length | 18 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 0.9% |
Sample
| 1st row | Laparoscopic right hemicolectomy |
|---|---|
| 2nd row | Anterior resection |
| 3rd row | Laparoscopic anterior resection |
| 4th row | Laparoscopic right hemicolectomy |
| 5th row | Laparoscopic anterior resection |
Common Values
| Value | Count | Frequency (%) |
| Laparoscopic anterior resection | 126 | |
| Laparoscopic right hemicolectomy | 70 | |
| Laparoscopic low anterior resection | 47 | 13.6% |
| Laparoscopic left hemicolectomy | 26 | 7.5% |
| Anterior resection | 24 | 6.9% |
| Right hemicolectomy | 20 | 5.8% |
| Laparoscopic extended right hemicolectomy | 12 | 3.5% |
| Low anterior resection | 11 | 3.2% |
| Single Incision Laparoscopic Surgery (SILS) anterior resection | 4 | 1.2% |
| Extended right hemicolectomy | 3 | 0.9% |
| Other values (3) | 3 | 0.9% |
Length
| Value | Count | Frequency (%) |
| laparoscopic | 287 | |
| anterior | 212 | |
| resection | 212 | |
| hemicolectomy | 133 | |
| right | 106 | 9.9% |
| low | 58 | 5.4% |
| left | 28 | 2.6% |
| extended | 17 | 1.6% |
| single | 4 | 0.4% |
| incision | 4 | 0.4% |
| Other values (3) | 9 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 1328 | |
| c | 1058 | |
| r | 1017 | |
| e | 987 | |
| i | 963 | |
| a | 762 | 7.2% |
| 724 | 6.8% | |
| t | 709 | 6.7% |
| p | 574 | 5.4% |
| s | 503 | 4.8% |
| Other values (19) | 1948 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 10573 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 1328 | |
| c | 1058 | |
| r | 1017 | |
| e | 987 | |
| i | 963 | |
| a | 762 | 7.2% |
| 724 | 6.8% | |
| t | 709 | 6.7% |
| p | 574 | 5.4% |
| s | 503 | 4.8% |
| Other values (19) | 1948 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 10573 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 1328 | |
| c | 1058 | |
| r | 1017 | |
| e | 987 | |
| i | 963 | |
| a | 762 | 7.2% |
| 724 | 6.8% | |
| t | 709 | 6.7% |
| p | 574 | 5.4% |
| s | 503 | 4.8% |
| Other values (19) | 1948 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 10573 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 1328 | |
| c | 1058 | |
| r | 1017 | |
| e | 987 | |
| i | 963 | |
| a | 762 | 7.2% |
| 724 | 6.8% | |
| t | 709 | 6.7% |
| p | 574 | 5.4% |
| s | 503 | 4.8% |
| Other values (19) | 1948 |
PreOp_Albumin
Real number (ℝ)
Missing
| Distinct | 29 |
|---|---|
| Distinct (%) | 10.0% |
| Missing | 57 |
| Missing (%) | 16.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.9176471 |
| Minimum | 2.1 |
|---|---|
| Maximum | 4.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.8 KiB |
Quantile statistics
| Minimum | 2.1 |
|---|---|
| 5-th percentile | 2.8 |
| Q1 | 3.6 |
| median | 4 |
| Q3 | 4.3 |
| 95-th percentile | 4.6 |
| Maximum | 4.9 |
| Range | 2.8 |
| Interquartile range (IQR) | 0.7 |
Descriptive statistics
| Standard deviation | 0.56137233 |
|---|---|
| Coefficient of variation (CV) | 0.14329324 |
| Kurtosis | 0.74058011 |
| Mean | 3.9176471 |
| Median Absolute Deviation (MAD) | 0.3 |
| Skewness | -0.97980087 |
| Sum | 1132.2 |
| Variance | 0.31513889 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 29 | 8.4% |
| 4.2 | 27 | 7.8% |
| 4.1 | 24 | 6.9% |
| 4.3 | 24 | 6.9% |
| 4.4 | 24 | 6.9% |
| 3.9 | 21 | 6.1% |
| 4.5 | 16 | 4.6% |
| 3.8 | 15 | 4.3% |
| 3.5 | 13 | 3.8% |
| 4.6 | 11 | 3.2% |
| Other values (19) | 85 | |
| (Missing) | 57 |
| Value | Count | Frequency (%) |
| 2.1 | 2 | |
| 2.2 | 1 | 0.3% |
| 2.3 | 2 | |
| 2.4 | 2 | |
| 2.5 | 2 | |
| 2.6 | 1 | 0.3% |
| 2.7 | 3 | |
| 2.8 | 4 | |
| 2.9 | 4 | |
| 3 | 4 |
| Value | Count | Frequency (%) |
| 4.9 | 1 | 0.3% |
| 4.8 | 6 | 1.7% |
| 4.7 | 6 | 1.7% |
| 4.6 | 11 | 3.2% |
| 4.5 | 16 | |
| 4.4 | 24 | |
| 4.3 | 24 | |
| 4.2 | 27 | |
| 4.1 | 24 | |
| 4 | 29 |
Last_FU_Date
Date
| Distinct | 227 |
|---|---|
| Distinct (%) | 65.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.8 KiB |
| Minimum | 2017-10-21 00:00:00 |
|---|---|
| Maximum | 2024-05-04 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
Recurrence
Categorical
High correlation
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 17.0 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 258 | |
| 1 | 88 | 25.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 258 | |
| 1 | 88 | 25.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 258 | |
| 1 | 88 | 25.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 346 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 258 | |
| 1 | 88 | 25.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 346 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 258 | |
| 1 | 88 | 25.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 346 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 258 | |
| 1 | 88 | 25.4% |
Recurrence_Date
Date
Missing
| Distinct | 87 |
|---|---|
| Distinct (%) | 98.9% |
| Missing | 258 |
| Missing (%) | 74.6% |
| Memory size | 2.8 KiB |
| Minimum | 2017-10-17 00:00:00 |
|---|---|
| Maximum | 2023-06-06 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
Recurrence_Type
Categorical
High correlation Imbalance Missing
| Distinct | 2 |
|---|---|
| Distinct (%) | 2.3% |
| Missing | 258 |
| Missing (%) | 74.6% |
| Memory size | 19.1 KiB |
| Distant | |
|---|---|
| Locoregional | 8 |
Length
| Max length | 12 |
|---|---|
| Median length | 7 |
| Mean length | 7.4545455 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Locoregional |
|---|---|
| 2nd row | Distant |
| 3rd row | Distant |
| 4th row | Distant |
| 5th row | Distant |
Common Values
| Value | Count | Frequency (%) |
| Distant | 80 | 23.1% |
| Locoregional | 8 | 2.3% |
| (Missing) | 258 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| distant | 80 | |
| locoregional | 8 | 9.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 160 | |
| i | 88 | |
| a | 88 | |
| n | 88 | |
| D | 80 | |
| s | 80 | |
| o | 24 | 3.7% |
| L | 8 | 1.2% |
| c | 8 | 1.2% |
| r | 8 | 1.2% |
| Other values (3) | 24 | 3.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 656 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| t | 160 | |
| i | 88 | |
| a | 88 | |
| n | 88 | |
| D | 80 | |
| s | 80 | |
| o | 24 | 3.7% |
| L | 8 | 1.2% |
| c | 8 | 1.2% |
| r | 8 | 1.2% |
| Other values (3) | 24 | 3.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 656 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| t | 160 | |
| i | 88 | |
| a | 88 | |
| n | 88 | |
| D | 80 | |
| s | 80 | |
| o | 24 | 3.7% |
| L | 8 | 1.2% |
| c | 8 | 1.2% |
| r | 8 | 1.2% |
| Other values (3) | 24 | 3.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 656 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| t | 160 | |
| i | 88 | |
| a | 88 | |
| n | 88 | |
| D | 80 | |
| s | 80 | |
| o | 24 | 3.7% |
| L | 8 | 1.2% |
| c | 8 | 1.2% |
| r | 8 | 1.2% |
| Other values (3) | 24 | 3.7% |
Death
Categorical
High correlation
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 17.0 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 236 | |
| 1 | 110 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 236 | |
| 1 | 110 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 236 | |
| 1 | 110 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 346 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 236 | |
| 1 | 110 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 346 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 236 | |
| 1 | 110 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 346 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 236 | |
| 1 | 110 |
Death_Cause
Categorical
High correlation
| Distinct | 5 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 17.0 KiB |
| 0 | |
|---|---|
| 1 | |
| 3 | 23 |
| 9 | 13 |
| 2 | 4 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 235 | |
| 1 | 71 | 20.5% |
| 3 | 23 | 6.6% |
| 9 | 13 | 3.8% |
| 2 | 4 | 1.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 235 | |
| 1 | 71 | 20.5% |
| 3 | 23 | 6.6% |
| 9 | 13 | 3.8% |
| 2 | 4 | 1.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 235 | |
| 1 | 71 | 20.5% |
| 3 | 23 | 6.6% |
| 9 | 13 | 3.8% |
| 2 | 4 | 1.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 346 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 235 | |
| 1 | 71 | 20.5% |
| 3 | 23 | 6.6% |
| 9 | 13 | 3.8% |
| 2 | 4 | 1.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 346 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 235 | |
| 1 | 71 | 20.5% |
| 3 | 23 | 6.6% |
| 9 | 13 | 3.8% |
| 2 | 4 | 1.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 346 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 235 | |
| 1 | 71 | 20.5% |
| 3 | 23 | 6.6% |
| 9 | 13 | 3.8% |
| 2 | 4 | 1.2% |
DFS_Months
Real number (ℝ)
High correlation
| Distinct | 326 |
|---|---|
| Distinct (%) | 94.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 35.992842 |
| Minimum | 0.53 |
|---|---|
| Maximum | 87.37 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.8 KiB |
Quantile statistics
| Minimum | 0.53 |
|---|---|
| 5-th percentile | 4.285 |
| Q1 | 16.108333 |
| median | 35.38 |
| Q3 | 52.3525 |
| 95-th percentile | 74.1325 |
| Maximum | 87.37 |
| Range | 86.84 |
| Interquartile range (IQR) | 36.244167 |
Descriptive statistics
| Standard deviation | 22.358127 |
|---|---|
| Coefficient of variation (CV) | 0.62118259 |
| Kurtosis | -0.87069922 |
| Mean | 35.992842 |
| Median Absolute Deviation (MAD) | 18.701667 |
| Skewness | 0.22572253 |
| Sum | 12453.523 |
| Variance | 499.88584 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 39.73 | 3 | 0.9% |
| 42.27 | 2 | 0.6% |
| 2.133333333 | 2 | 0.6% |
| 7.266666667 | 2 | 0.6% |
| 62.97 | 2 | 0.6% |
| 43.4 | 2 | 0.6% |
| 3.9 | 2 | 0.6% |
| 61.33 | 2 | 0.6% |
| 60.03 | 2 | 0.6% |
| 62.43 | 2 | 0.6% |
| Other values (316) | 325 |
| Value | Count | Frequency (%) |
| 0.53 | 1 | |
| 0.8 | 1 | |
| 1.03 | 1 | |
| 1.17 | 1 | |
| 1.37 | 1 | |
| 1.63 | 1 | |
| 2.133333333 | 2 | |
| 2.27 | 1 | |
| 2.5 | 1 | |
| 2.7 | 1 |
| Value | Count | Frequency (%) |
| 87.37 | 1 | |
| 86.83 | 1 | |
| 86.27 | 1 | |
| 85.93 | 2 | |
| 84.33 | 1 | |
| 83.4 | 1 | |
| 82.93 | 1 | |
| 82.53 | 1 | |
| 81.67 | 1 | |
| 77.83 | 1 |
OS_Months
Real number (ℝ)
High correlation
| Distinct | 321 |
|---|---|
| Distinct (%) | 92.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 41.032023 |
| Minimum | 0.53 |
|---|---|
| Maximum | 88.63 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.8 KiB |
Quantile statistics
| Minimum | 0.53 |
|---|---|
| 5-th percentile | 7.5775 |
| Q1 | 26.54 |
| median | 40.115 |
| Q3 | 57.865 |
| 95-th percentile | 74.9575 |
| Maximum | 88.63 |
| Range | 88.1 |
| Interquartile range (IQR) | 31.325 |
Descriptive statistics
| Standard deviation | 20.548568 |
|---|---|
| Coefficient of variation (CV) | 0.50079343 |
| Kurtosis | -0.63872963 |
| Mean | 41.032023 |
| Median Absolute Deviation (MAD) | 14.7 |
| Skewness | 0.10925279 |
| Sum | 14197.08 |
| Variance | 422.24363 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 39.73 | 3 | 0.9% |
| 43.33 | 3 | 0.9% |
| 21.63 | 3 | 0.9% |
| 26 | 2 | 0.6% |
| 42.27 | 2 | 0.6% |
| 19.7 | 2 | 0.6% |
| 43.4 | 2 | 0.6% |
| 58.33 | 2 | 0.6% |
| 60.03 | 2 | 0.6% |
| 62.97 | 2 | 0.6% |
| Other values (311) | 323 |
| Value | Count | Frequency (%) |
| 0.53 | 1 | |
| 0.8 | 1 | |
| 1.03 | 1 | |
| 1.17 | 1 | |
| 1.37 | 1 | |
| 1.63 | 1 | |
| 2.27 | 1 | |
| 2.97 | 1 | |
| 3.6 | 1 | |
| 4.17 | 1 |
| Value | Count | Frequency (%) |
| 88.63 | 1 | |
| 87.37 | 1 | |
| 86.83 | 1 | |
| 86.27 | 1 | |
| 85.93 | 2 | |
| 84.33 | 1 | |
| 83.4 | 1 | |
| 82.93 | 1 | |
| 82.53 | 1 | |
| 81.67 | 1 |
Visiting_Staff
Real number (ℝ)
| Distinct | 7 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.9942197 |
| Minimum | 1 |
|---|---|
| Maximum | 7 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.8 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 6 |
| Maximum | 7 |
| Range | 6 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 1.9080215 |
|---|---|
| Coefficient of variation (CV) | 0.63723499 |
| Kurtosis | -1.1608759 |
| Mean | 2.9942197 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.41122301 |
| Sum | 1036 |
| Variance | 3.6405462 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 127 | |
| 5 | 61 | |
| 4 | 43 | 12.4% |
| 3 | 41 | 11.8% |
| 2 | 37 | 10.7% |
| 6 | 24 | 6.9% |
| 7 | 13 | 3.8% |
| Value | Count | Frequency (%) |
| 1 | 127 | |
| 2 | 37 | 10.7% |
| 3 | 41 | 11.8% |
| 4 | 43 | 12.4% |
| 5 | 61 | |
| 6 | 24 | 6.9% |
| 7 | 13 | 3.8% |
| Value | Count | Frequency (%) |
| 7 | 13 | 3.8% |
| 6 | 24 | 6.9% |
| 5 | 61 | |
| 4 | 43 | 12.4% |
| 3 | 41 | 11.8% |
| 2 | 37 | 10.7% |
| 1 | 127 |
Interactions
Correlations
| AJCC_Substage | Age | BMI | CEA_PreOp | Chart_No | DFS_Months | Death | Death_Cause | Differentiation | Dx_Year | ECOG | Histology | LNR | LN_Positive | LN_Total | LVI | Log_CEA_PreOp | MSI_Status | Mucinous_Any | Mucinous_Gt_50 | OS_Months | Op_Procedure | PNI | Patient_ID | PreOp_Albumin | Recurrence | Recurrence_Type | Sex | Tumor_Deposits | Tumor_Location | Tumor_Location_Group | Tumor_Size_cm | Visiting_Staff | pN_Stage | pT_Stage | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| AJCC_Substage | 1.000 | 0.144 | 0.000 | 0.093 | 0.000 | 0.185 | 0.247 | 0.227 | 0.146 | 0.000 | 0.038 | 0.117 | 0.429 | 0.563 | 0.015 | 0.215 | 0.165 | 0.000 | 0.000 | 0.059 | 0.119 | 0.059 | 0.175 | 0.070 | 0.183 | 0.300 | 0.000 | 0.000 | 0.079 | 0.137 | 0.127 | 0.382 | 0.000 | 0.597 | 0.718 |
| Age | 0.144 | 1.000 | -0.166 | -0.012 | -0.240 | -0.225 | 0.336 | 0.190 | 0.085 | 0.115 | 0.318 | 0.070 | 0.090 | 0.018 | -0.225 | 0.000 | -0.012 | 0.000 | 0.144 | 0.159 | -0.282 | 0.121 | 0.000 | -0.240 | -0.383 | 0.000 | 0.000 | 0.065 | 0.000 | -0.063 | 0.110 | 0.064 | 0.074 | 0.000 | 0.095 |
| BMI | 0.000 | -0.166 | 1.000 | 0.014 | 0.036 | 0.180 | 0.086 | 0.057 | 0.000 | 0.000 | 0.081 | 0.000 | -0.092 | -0.093 | -0.053 | 0.013 | 0.014 | 0.108 | 0.000 | 0.000 | 0.204 | 0.000 | 0.131 | 0.036 | 0.092 | 0.000 | 0.000 | 0.093 | 0.145 | 0.019 | 0.081 | 0.020 | -0.010 | 0.101 | 0.000 |
| CEA_PreOp | 0.093 | -0.012 | 0.014 | 1.000 | 0.175 | 0.401 | 0.014 | 0.058 | 0.037 | 0.015 | 0.000 | 0.000 | -0.131 | -0.140 | -0.017 | 0.008 | 1.000 | 0.174 | 0.000 | 0.000 | 0.165 | 0.000 | 0.000 | 0.175 | 0.042 | 0.000 | 1.000 | 0.035 | 0.000 | 0.091 | 0.000 | -0.105 | -0.037 | 0.044 | 0.129 |
| Chart_No | 0.000 | -0.240 | 0.036 | 0.175 | 1.000 | -0.069 | 0.137 | 0.117 | 0.050 | 0.249 | 0.204 | 0.086 | -0.025 | 0.007 | 0.084 | 0.000 | 0.175 | 0.000 | 0.000 | 0.073 | -0.083 | 0.000 | 0.138 | 1.000 | 0.053 | 0.129 | 0.345 | 0.171 | 0.013 | 0.071 | 0.091 | 0.141 | -0.171 | 0.133 | 0.038 |
| DFS_Months | 0.185 | -0.225 | 0.180 | 0.401 | -0.069 | 1.000 | 0.672 | 0.349 | 0.016 | 0.487 | 0.117 | 0.000 | -0.148 | -0.148 | 0.040 | 0.112 | 0.401 | 0.000 | 0.000 | 0.000 | 0.871 | 0.083 | 0.081 | -0.069 | 0.224 | 0.610 | 0.233 | 0.000 | 0.000 | 0.120 | 0.150 | -0.194 | -0.125 | 0.153 | 0.117 |
| Death | 0.247 | 0.336 | 0.086 | 0.014 | 0.137 | 0.672 | 1.000 | 0.989 | 0.139 | 0.000 | 0.326 | 0.032 | 0.306 | 0.241 | 0.000 | 0.093 | 0.384 | 0.055 | 0.000 | 0.000 | 0.561 | 0.213 | 0.047 | 0.121 | 0.331 | 0.547 | 0.000 | 0.000 | 0.000 | 0.193 | 0.184 | 0.122 | 0.169 | 0.276 | 0.182 |
| Death_Cause | 0.227 | 0.190 | 0.057 | 0.058 | 0.117 | 0.349 | 0.989 | 1.000 | 0.028 | 0.000 | 0.275 | 0.000 | 0.243 | 0.145 | 0.162 | 0.094 | 0.235 | 0.079 | 0.000 | 0.000 | 0.286 | 0.128 | 0.000 | 0.143 | 0.176 | 0.652 | 0.000 | 0.000 | 0.000 | 0.172 | 0.251 | 0.017 | 0.080 | 0.229 | 0.126 |
| Differentiation | 0.146 | 0.085 | 0.000 | 0.037 | 0.050 | 0.016 | 0.139 | 0.028 | 1.000 | 0.036 | 0.131 | 0.320 | 0.070 | 0.083 | 0.000 | 0.081 | 0.000 | 0.255 | 0.314 | 0.444 | 0.082 | 0.508 | 0.000 | 0.089 | 0.158 | 0.201 | 0.000 | 0.000 | 0.045 | 0.065 | 0.136 | 0.255 | 0.000 | 0.138 | 0.056 |
| Dx_Year | 0.000 | 0.115 | 0.000 | 0.015 | 0.249 | 0.487 | 0.000 | 0.000 | 0.036 | 1.000 | 0.059 | 0.000 | 0.019 | 0.000 | 0.000 | 0.149 | 0.137 | 0.000 | 0.000 | 0.000 | 0.502 | 0.154 | 0.000 | 0.504 | 0.000 | 0.150 | 0.056 | 0.000 | 0.000 | 0.056 | 0.000 | 0.049 | 0.141 | 0.124 | 0.000 |
| ECOG | 0.038 | 0.318 | 0.081 | 0.000 | 0.204 | 0.117 | 0.326 | 0.275 | 0.131 | 0.059 | 1.000 | 0.138 | 0.158 | 0.051 | 0.000 | 0.147 | 0.000 | 0.000 | 0.111 | 0.173 | 0.141 | 0.219 | 0.000 | 0.153 | 0.233 | 0.136 | 0.000 | 0.067 | 0.000 | 0.000 | 0.115 | 0.116 | 0.411 | 0.000 | 0.085 |
| Histology | 0.117 | 0.070 | 0.000 | 0.000 | 0.086 | 0.000 | 0.032 | 0.000 | 0.320 | 0.000 | 0.138 | 1.000 | 0.226 | 0.163 | 0.000 | 0.000 | 0.000 | 0.068 | 0.806 | 0.748 | 0.000 | 0.179 | 0.000 | 0.000 | 0.135 | 0.075 | 0.000 | 0.037 | 0.000 | 0.060 | 0.093 | 0.273 | 0.128 | 0.163 | 0.000 |
| LNR | 0.429 | 0.090 | -0.092 | -0.131 | -0.025 | -0.148 | 0.306 | 0.243 | 0.070 | 0.019 | 0.158 | 0.226 | 1.000 | 0.927 | -0.210 | 0.175 | -0.131 | 0.031 | 0.134 | 0.035 | -0.125 | 0.000 | 0.106 | -0.025 | 0.006 | 0.279 | 0.000 | 0.000 | 0.220 | -0.069 | 0.000 | 0.078 | 0.080 | 0.790 | 0.063 |
| LN_Positive | 0.563 | 0.018 | -0.093 | -0.140 | 0.007 | -0.148 | 0.241 | 0.145 | 0.083 | 0.000 | 0.051 | 0.163 | 0.927 | 1.000 | 0.142 | 0.177 | -0.140 | 0.000 | 0.101 | 0.000 | -0.116 | 0.000 | 0.183 | 0.007 | -0.038 | 0.245 | 0.000 | 0.000 | 0.118 | -0.101 | 0.000 | 0.163 | 0.032 | 0.696 | 0.137 |
| LN_Total | 0.015 | -0.225 | -0.053 | -0.017 | 0.084 | 0.040 | 0.000 | 0.162 | 0.000 | 0.000 | 0.000 | 0.000 | -0.210 | 0.142 | 1.000 | 0.035 | -0.017 | 0.077 | 0.000 | 0.000 | 0.045 | 0.000 | 0.000 | 0.084 | -0.069 | 0.000 | 0.047 | 0.098 | 0.000 | -0.070 | 0.124 | 0.226 | -0.110 | 0.000 | 0.099 |
| LVI | 0.215 | 0.000 | 0.013 | 0.008 | 0.000 | 0.112 | 0.093 | 0.094 | 0.081 | 0.149 | 0.147 | 0.000 | 0.175 | 0.177 | 0.035 | 1.000 | 0.075 | 0.000 | 0.014 | 0.060 | 0.079 | 0.000 | 0.142 | 0.064 | 0.073 | 0.157 | 0.000 | 0.016 | 0.079 | 0.086 | 0.000 | 0.071 | 0.000 | 0.233 | 0.163 |
| Log_CEA_PreOp | 0.165 | -0.012 | 0.014 | 1.000 | 0.175 | 0.401 | 0.384 | 0.235 | 0.000 | 0.137 | 0.000 | 0.000 | -0.131 | -0.140 | -0.017 | 0.075 | 1.000 | 0.162 | 0.000 | 0.000 | 0.165 | 0.108 | 0.056 | 0.175 | 0.042 | 0.759 | 1.000 | 0.158 | 0.000 | 0.091 | 0.036 | -0.105 | -0.037 | 0.111 | 0.147 |
| MSI_Status | 0.000 | 0.000 | 0.108 | 0.174 | 0.000 | 0.000 | 0.055 | 0.079 | 0.255 | 0.000 | 0.000 | 0.068 | 0.031 | 0.000 | 0.077 | 0.000 | 0.162 | 1.000 | 0.212 | 0.137 | 0.000 | 0.306 | 0.000 | 0.000 | 0.109 | 0.000 | 0.061 | 0.086 | 0.000 | 0.323 | 0.239 | 0.386 | 0.000 | 0.130 | 0.072 |
| Mucinous_Any | 0.000 | 0.144 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.314 | 0.000 | 0.111 | 0.806 | 0.134 | 0.101 | 0.000 | 0.014 | 0.000 | 0.212 | 1.000 | 0.708 | 0.000 | 0.239 | 0.000 | 0.000 | 0.183 | 0.000 | 0.000 | 0.000 | 0.000 | 0.205 | 0.150 | 0.220 | 0.112 | 0.150 | 0.000 |
| Mucinous_Gt_50 | 0.059 | 0.159 | 0.000 | 0.000 | 0.073 | 0.000 | 0.000 | 0.000 | 0.444 | 0.000 | 0.173 | 0.748 | 0.035 | 0.000 | 0.000 | 0.060 | 0.000 | 0.137 | 0.708 | 1.000 | 0.000 | 0.250 | 0.000 | 0.000 | 0.215 | 0.000 | 0.000 | 0.000 | 0.000 | 0.179 | 0.121 | 0.196 | 0.172 | 0.102 | 0.079 |
| OS_Months | 0.119 | -0.282 | 0.204 | 0.165 | -0.083 | 0.871 | 0.561 | 0.286 | 0.082 | 0.502 | 0.141 | 0.000 | -0.125 | -0.116 | 0.045 | 0.079 | 0.165 | 0.000 | 0.000 | 0.000 | 1.000 | 0.124 | 0.000 | -0.083 | 0.266 | 0.252 | 0.000 | 0.077 | 0.000 | 0.092 | 0.067 | -0.190 | -0.170 | 0.215 | 0.037 |
| Op_Procedure | 0.059 | 0.121 | 0.000 | 0.000 | 0.000 | 0.083 | 0.213 | 0.128 | 0.508 | 0.154 | 0.219 | 0.179 | 0.000 | 0.000 | 0.000 | 0.000 | 0.108 | 0.306 | 0.239 | 0.250 | 0.124 | 1.000 | 0.057 | 0.066 | 0.082 | 0.079 | 0.000 | 0.072 | 0.000 | 0.614 | 0.911 | 0.205 | 0.167 | 0.094 | 0.131 |
| PNI | 0.175 | 0.000 | 0.131 | 0.000 | 0.138 | 0.081 | 0.047 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.106 | 0.183 | 0.000 | 0.142 | 0.056 | 0.000 | 0.000 | 0.000 | 0.000 | 0.057 | 1.000 | 0.077 | 0.000 | 0.212 | 0.064 | 0.000 | 0.000 | 0.154 | 0.148 | 0.156 | 0.054 | 0.167 | 0.115 |
| Patient_ID | 0.070 | -0.240 | 0.036 | 0.175 | 1.000 | -0.069 | 0.121 | 0.143 | 0.089 | 0.504 | 0.153 | 0.000 | -0.025 | 0.007 | 0.084 | 0.064 | 0.175 | 0.000 | 0.000 | 0.000 | -0.083 | 0.066 | 0.077 | 1.000 | 0.053 | 0.201 | 0.487 | 0.185 | 0.000 | 0.071 | 0.000 | 0.141 | -0.171 | 0.183 | 0.048 |
| PreOp_Albumin | 0.183 | -0.383 | 0.092 | 0.042 | 0.053 | 0.224 | 0.331 | 0.176 | 0.158 | 0.000 | 0.233 | 0.135 | 0.006 | -0.038 | -0.069 | 0.073 | 0.042 | 0.109 | 0.183 | 0.215 | 0.266 | 0.082 | 0.000 | 0.053 | 1.000 | 0.123 | 0.000 | 0.059 | 0.021 | 0.223 | 0.223 | -0.391 | -0.245 | 0.000 | 0.199 |
| Recurrence | 0.300 | 0.000 | 0.000 | 0.000 | 0.129 | 0.610 | 0.547 | 0.652 | 0.201 | 0.150 | 0.136 | 0.075 | 0.279 | 0.245 | 0.000 | 0.157 | 0.759 | 0.000 | 0.000 | 0.000 | 0.252 | 0.079 | 0.212 | 0.201 | 0.123 | 1.000 | 1.000 | 0.000 | 0.000 | 0.155 | 0.086 | 0.107 | 0.000 | 0.257 | 0.234 |
| Recurrence_Type | 0.000 | 0.000 | 0.000 | 1.000 | 0.345 | 0.233 | 0.000 | 0.000 | 0.000 | 0.056 | 0.000 | 0.000 | 0.000 | 0.000 | 0.047 | 0.000 | 1.000 | 0.061 | 0.000 | 0.000 | 0.000 | 0.000 | 0.064 | 0.487 | 0.000 | 1.000 | 1.000 | 0.134 | 0.144 | 0.000 | 0.000 | 0.390 | 0.249 | 0.000 | 0.000 |
| Sex | 0.000 | 0.065 | 0.093 | 0.035 | 0.171 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.067 | 0.037 | 0.000 | 0.000 | 0.098 | 0.016 | 0.158 | 0.086 | 0.000 | 0.000 | 0.077 | 0.072 | 0.000 | 0.185 | 0.059 | 0.000 | 0.134 | 1.000 | 0.031 | 0.095 | 0.096 | 0.143 | 0.032 | 0.000 | 0.135 |
| Tumor_Deposits | 0.079 | 0.000 | 0.145 | 0.000 | 0.013 | 0.000 | 0.000 | 0.000 | 0.045 | 0.000 | 0.000 | 0.000 | 0.220 | 0.118 | 0.000 | 0.079 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.021 | 0.000 | 0.144 | 0.031 | 1.000 | 0.000 | 0.007 | 0.000 | 0.000 | 1.000 | 0.000 |
| Tumor_Location | 0.137 | -0.063 | 0.019 | 0.091 | 0.071 | 0.120 | 0.193 | 0.172 | 0.065 | 0.056 | 0.000 | 0.060 | -0.069 | -0.101 | -0.070 | 0.086 | 0.091 | 0.323 | 0.205 | 0.179 | 0.092 | 0.614 | 0.154 | 0.071 | 0.223 | 0.155 | 0.000 | 0.095 | 0.000 | 1.000 | 0.991 | -0.331 | 0.002 | 0.148 | 0.038 |
| Tumor_Location_Group | 0.127 | 0.110 | 0.081 | 0.000 | 0.091 | 0.150 | 0.184 | 0.251 | 0.136 | 0.000 | 0.115 | 0.093 | 0.000 | 0.000 | 0.124 | 0.000 | 0.036 | 0.239 | 0.150 | 0.121 | 0.067 | 0.911 | 0.148 | 0.000 | 0.223 | 0.086 | 0.000 | 0.096 | 0.007 | 0.991 | 1.000 | 0.306 | 0.000 | 0.000 | 0.153 |
| Tumor_Size_cm | 0.382 | 0.064 | 0.020 | -0.105 | 0.141 | -0.194 | 0.122 | 0.017 | 0.255 | 0.049 | 0.116 | 0.273 | 0.078 | 0.163 | 0.226 | 0.071 | -0.105 | 0.386 | 0.220 | 0.196 | -0.190 | 0.205 | 0.156 | 0.141 | -0.391 | 0.107 | 0.390 | 0.143 | 0.000 | -0.331 | 0.306 | 1.000 | 0.077 | 0.104 | 0.420 |
| Visiting_Staff | 0.000 | 0.074 | -0.010 | -0.037 | -0.171 | -0.125 | 0.169 | 0.080 | 0.000 | 0.141 | 0.411 | 0.128 | 0.080 | 0.032 | -0.110 | 0.000 | -0.037 | 0.000 | 0.112 | 0.172 | -0.170 | 0.167 | 0.054 | -0.171 | -0.245 | 0.000 | 0.249 | 0.032 | 0.000 | 0.002 | 0.000 | 0.077 | 1.000 | 0.000 | 0.000 |
| pN_Stage | 0.597 | 0.000 | 0.101 | 0.044 | 0.133 | 0.153 | 0.276 | 0.229 | 0.138 | 0.124 | 0.000 | 0.163 | 0.790 | 0.696 | 0.000 | 0.233 | 0.111 | 0.130 | 0.150 | 0.102 | 0.215 | 0.094 | 0.167 | 0.183 | 0.000 | 0.257 | 0.000 | 0.000 | 1.000 | 0.148 | 0.000 | 0.104 | 0.000 | 1.000 | 0.108 |
| pT_Stage | 0.718 | 0.095 | 0.000 | 0.129 | 0.038 | 0.117 | 0.182 | 0.126 | 0.056 | 0.000 | 0.085 | 0.000 | 0.063 | 0.137 | 0.099 | 0.163 | 0.147 | 0.072 | 0.000 | 0.079 | 0.037 | 0.131 | 0.115 | 0.048 | 0.199 | 0.234 | 0.000 | 0.135 | 0.000 | 0.038 | 0.153 | 0.420 | 0.000 | 0.108 | 1.000 |
Missing values
Sample
| Patient_ID | Chart_No | Dx_Date | Dx_Year | Age | Sex | BMI | ECOG | Tumor_Location | Tumor_Location_Group | pT_Stage | pN_Stage | AJCC_Substage | LN_Total | LN_Positive | LNR | Histology | Differentiation | LVI | PNI | Tumor_Deposits | Mucinous_Gt_50 | Mucinous_Any | Signet_Ring | MSI_Status | Tumor_Size_cm | CEA_PreOp | Log_CEA_PreOp | Radical_Op_Date | Op_Procedure | PreOp_Albumin | Last_FU_Date | Recurrence | Recurrence_Date | Recurrence_Type | Death | Death_Cause | DFS_Months | OS_Months | Visiting_Staff | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 170832 | 2017-10-16 | 2017 | 77 | 2 | 24.24 | 2.0 | 4 | 1 | 4A | 2B | 3C | 17 | 12 | 0.705882 | 2 | 3 | 1.0 | 1.0 | 0 | 1 | 1 | NaN | MSS | 3.5 | 0.5 | 0.405465 | 2017-10-20 | Laparoscopic right hemicolectomy | 3.6 | 2020-03-15 | 1 | 2019-04-08 | Locoregional | 1 | 3 | 17.833333 | 29.57 | 4 |
| 1 | 2 | 190783 | 2017-06-20 | 2017 | 82 | 1 | 22.23 | 1.0 | 7 | 2 | 3 | 1B | 3B | 34 | 2 | 0.058824 | 1 | 2 | 0.0 | 0.0 | 0 | 0 | 0 | NaN | MSS | 4.0 | 2.1 | 1.131402 | 2017-07-07 | Anterior resection | NaN | 2024-03-20 | 0 | NaT | NaN | 0 | 0 | 82.530000 | 82.53 | 2 |
| 2 | 3 | 335615 | 2021-05-15 | 2021 | 71 | 1 | 30.12 | 1.0 | 7 | 2 | 3 | NaN | 3B | 15 | 0 | 0.000000 | 1 | 1 | 1.0 | 0.0 | 1 | 0 | 0 | NaN | MSS | 2.6 | 2.1 | 1.131402 | 2021-06-07 | Laparoscopic anterior resection | 3.3 | 2024-05-01 | 0 | NaT | NaN | 0 | 0 | 36.070000 | 36.07 | 1 |
| 3 | 4 | 458173 | 2021-11-04 | 2021 | 87 | 2 | 25.11 | 1.0 | 2 | 1 | 4A | 1B | 3B | 17 | 3 | 0.176471 | 1 | 3 | 0.0 | 1.0 | 0 | 0 | 1 | NaN | MSI-H | 9.2 | 2.1 | 1.131402 | 2021-11-22 | Laparoscopic right hemicolectomy | 3.8 | 2024-02-29 | 0 | NaT | NaN | 0 | 0 | 28.230000 | 28.23 | 1 |
| 4 | 5 | 536710 | 2020-07-17 | 2020 | 90 | 1 | 23.39 | 2.0 | 7 | 2 | 3 | 1B | 3B | 18 | 3 | 0.166667 | 2 | 4 | 1.0 | 0.0 | 0 | 1 | 1 | NaN | MSS | 7.0 | 2.1 | 1.131402 | 2020-07-24 | Laparoscopic anterior resection | 3.2 | 2020-08-19 | 0 | NaT | NaN | 1 | 1 | 1.170000 | 1.17 | 5 |
| 5 | 6 | 545620 | 2021-07-29 | 2021 | 87 | 1 | 20.73 | 1.0 | 2 | 1 | 4A | 1B | 3B | 23 | 2 | 0.086957 | 1 | 2 | 1.0 | 0.0 | 0 | 0 | 0 | NaN | MSS | 7.2 | 0.5 | 0.405465 | 2021-08-16 | Laparoscopic right hemicolectomy | 3.5 | 2023-06-26 | 1 | 2023-03-13 | Distant | 1 | 1 | 19.133333 | 23.47 | 1 |
| 6 | 7 | 657589 | 2017-10-16 | 2017 | 71 | 2 | 18.89 | 0.0 | 7 | 2 | 1 | 1B | 3A | 34 | 3 | 0.088235 | 1 | 2 | 0.0 | 0.0 | 0 | 0 | 0 | NaN | MSS | 0.9 | 2.2 | 1.163151 | 2017-11-30 | Laparoscopic anterior resection | 3.6 | 2019-09-11 | 0 | NaT | NaN | 1 | 3 | 23.170000 | 23.17 | 3 |
| 7 | 8 | 706865 | 2018-11-26 | 2018 | 64 | 1 | 29.94 | NaN | 7 | 2 | 3 | 1B | 3B | 22 | 3 | 0.136364 | 1 | 2 | 0.0 | 0.0 | 0 | 0 | 0 | NaN | MSS | 3.2 | 2.2 | 1.163151 | 2018-12-18 | Laparoscopic anterior resection | NaN | 2024-05-02 | 0 | NaT | NaN | 0 | 0 | 66.130000 | 66.13 | 2 |
| 8 | 9 | 790078 | 2020-06-19 | 2020 | 83 | 2 | 22.03 | 2.0 | 7 | 2 | 4B | 1B | 3C | 21 | 2 | 0.095238 | 1 | 2 | 1.0 | 0.0 | 0 | 0 | 0 | NaN | MSS | 3.6 | 0.5 | 0.405465 | 2020-06-19 | Laparoscopic anterior resection | 3.5 | 2024-05-03 | 1 | 2023-06-06 | Distant | 0 | 0 | 36.066667 | 47.73 | 4 |
| 9 | 10 | 826302 | 2018-09-04 | 2018 | 84 | 2 | 32.51 | NaN | 4 | 1 | 3 | 1B | 3B | 13 | 2 | 0.153846 | 1 | 3 | 0.0 | 0.0 | 0 | 0 | 0 | NaN | MSI-H | 6.5 | 0.5 | 0.405465 | 2018-09-26 | Laparoscopic left hemicolectomy | 2.7 | 2019-01-07 | 1 | 2018-12-22 | Distant | 1 | 1 | 2.900000 | 4.17 | 5 |
| Patient_ID | Chart_No | Dx_Date | Dx_Year | Age | Sex | BMI | ECOG | Tumor_Location | Tumor_Location_Group | pT_Stage | pN_Stage | AJCC_Substage | LN_Total | LN_Positive | LNR | Histology | Differentiation | LVI | PNI | Tumor_Deposits | Mucinous_Gt_50 | Mucinous_Any | Signet_Ring | MSI_Status | Tumor_Size_cm | CEA_PreOp | Log_CEA_PreOp | Radical_Op_Date | Op_Procedure | PreOp_Albumin | Last_FU_Date | Recurrence | Recurrence_Date | Recurrence_Type | Death | Death_Cause | DFS_Months | OS_Months | Visiting_Staff | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 336 | 337 | 19070510 | 2021-10-12 | 2021 | 39 | 2 | 36.27 | 0.0 | 6 | 2 | 4A | NaN | 3C | 12 | 4 | 0.333333 | 1 | 2 | 0.0 | 1.0 | 0 | 0 | 0 | NaN | MSI-H | 5.4 | 2.1 | 1.131402 | 2021-10-27 | Laparoscopic left hemicolectomy | NaN | 2023-05-26 | 1 | 2021-12-30 | Distant | 1 | 1 | 2.133333 | 19.70 | 5 |
| 337 | 338 | 19114886 | 2021-10-06 | 2021 | 88 | 1 | 20.73 | 1.0 | 7 | 2 | 3 | 2B | 3C | 18 | 7 | 0.388889 | 1 | 2 | 1.0 | 0.0 | 0 | 0 | 0 | NaN | MSS | 3.2 | 6.8 | 2.054124 | 2021-10-28 | Laparoscopic anterior resection | NaN | 2024-04-08 | 0 | NaT | NaN | 0 | 0 | 30.700000 | 30.70 | 3 |
| 338 | 339 | 19127334 | 2021-09-22 | 2021 | 43 | 2 | 19.80 | 1.0 | 7 | 2 | 3 | 1B | 3B | 33 | 3 | 0.090909 | 1 | 2 | 1.0 | 0.0 | 0 | 0 | 0 | NaN | MSS | 3.0 | 2.1 | 1.131402 | 2021-10-15 | Anterior resection | NaN | 2024-04-16 | 0 | NaT | NaN | 0 | 0 | 31.230000 | 31.23 | 4 |
| 339 | 340 | 19161821 | 2021-10-11 | 2021 | 89 | 1 | 27.61 | 1.0 | 2 | 1 | 4A | NaN | 3B | 16 | 0 | 0.000000 | 1 | 2 | 0.0 | 0.0 | 1 | 0 | 0 | NaN | MSS | 6.5 | 4.6 | 1.722767 | 2021-10-25 | Laparoscopic right hemicolectomy | NaN | 2023-09-25 | 0 | NaT | NaN | 1 | 9 | 23.800000 | 23.80 | 1 |
| 340 | 341 | 19219706 | 2021-11-26 | 2021 | 54 | 1 | 27.24 | 1.0 | 8 | 2 | 3 | 1B | 3B | 15 | 3 | 0.200000 | 1 | 2 | 0.0 | 0.0 | 0 | 0 | 0 | NaN | MSS | 4.7 | 4.3 | 1.667707 | 2021-12-13 | Laparoscopic low anterior resection | 4.6 | 2024-04-16 | 0 | NaT | NaN | 0 | 0 | 29.070000 | 29.07 | 1 |
| 341 | 342 | 19234425 | 2021-11-23 | 2021 | 58 | 2 | 21.72 | 1.0 | 2 | 1 | 4A | 1A | 3B | 24 | 1 | 0.041667 | 1 | 3 | 1.0 | 0.0 | 0 | 0 | 0 | NaN | MSS | 3.5 | 2.1 | 1.131402 | 2021-12-03 | Laparoscopic right hemicolectomy | 3.6 | 2023-06-07 | 1 | 2022-06-16 | Distant | 1 | 1 | 6.500000 | 18.70 | 1 |
| 342 | 343 | 19244963 | 2021-11-25 | 2021 | 55 | 1 | 22.74 | 1.0 | 8 | 2 | 2 | 1A | 3A | 22 | 1 | 0.045455 | 1 | 2 | 0.0 | 0.0 | 0 | 0 | 0 | NaN | MSS | 2.9 | 1.3 | 0.832909 | 2022-01-03 | Laparoscopic low anterior resection | NaN | 2024-04-12 | 0 | NaT | NaN | 0 | 0 | 28.970000 | 28.97 | 1 |
| 343 | 344 | 19277828 | 2021-11-29 | 2021 | 62 | 1 | 25.10 | 1.0 | 7 | 2 | 2 | 1A | 3A | 22 | 1 | 0.045455 | 1 | 2 | 1.0 | 0.0 | 0 | 0 | 0 | NaN | MSS | 4.4 | 3.0 | 1.386294 | 2021-12-20 | Laparoscopic anterior resection | 4.5 | 2021-12-30 | 0 | NaT | NaN | 0 | 0 | 1.030000 | 1.03 | 1 |
| 344 | 345 | 19332242 | 2022-01-04 | 2021 | 60 | 2 | 25.35 | 1.0 | 7 | 2 | 3 | NaN | 3B | 20 | 0 | 0.000000 | 1 | 2 | 0.0 | 0.0 | 1 | 0 | 0 | NaN | MSS | 4.6 | 12.9 | 2.631889 | 2022-01-25 | Laparoscopic anterior resection | 3.2 | 2024-04-11 | 0 | NaT | NaN | 0 | 0 | 27.830000 | 27.83 | 6 |
| 345 | 346 | 19350595 | 2021-12-15 | 2021 | 50 | 2 | 19.33 | 0.0 | 8 | 2 | 4A | 1A | 3B | 14 | 1 | 0.071429 | 1 | 2 | 0.0 | 0.0 | 0 | 0 | 0 | NaN | MSS | 5.2 | 1.1 | 0.741937 | 2022-02-04 | Laparoscopic low anterior resection | 4.5 | 2024-02-21 | 0 | NaT | NaN | 0 | 0 | 26.600000 | 26.60 | 3 |